<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" Content="text-html; charset=Windows-1252">
<title>Character Set Recognition</title>
<style>@import url(coUA.css);</style>
</HEAD>

<BODY>
<h3><a name="om40char_0001030404000000"></a>Character Set Recognition</h3>
<p>
Internet Explorer uses the character set specified for a document to determine how to translate the bytes in the document into characters on the screen or paper. By default, Internet Explorer uses the character set specified in the HTTP content type returned by the server to determine this translation. If this parameter is not given, Internet Explorer uses the character set specified by the META element in the document. It uses the user's preferences if no META element is given. </p>
<p>
You can use the META element to explicitly set the character set for a document. In this case, you set the <a href="html0kog.htm#ie40html_ref_0001030101003502">HTTP-EQUIV=</a> attribute to &quot;Content-Type&quot; and specify a character set identifier in the <a href="html0kog.htm#ie40html_ref_0001030101003501">CONTENT=</a> attribute. For example, the following META element identifies Windows-1251 as the character set for the document. </p>
<p>
&lt;META HTTP-EQUIV=&quot;Content-Type&quot;</p>
<p>
  CONTENT=&quot;text/html; CHARSET=Windows-1251&quot;&amp;gt;</p>
<p>
As long as you place the META element before the BODY element, it affects the whole document, including the TITLE element. For clarity it should appear as the first element after HEAD so that all readers know the encoding before the first displayable is parsed. Note that the META element applies to the document containing it. This means, for example, that a compound document (a document consisting of two or more documents in a set of frames) can use different character sets in different frames. </p>
<table>
<tr valign=top>
<td>
<b>Windows Codepage # </b></td>
<td>
<b>Display name </b></td>
<td>
<b>Preferred ID on SAVE </b></td>
<td>
<b>Aliases in Internet Explorer 4 </b></td>
</tr>
<tr valign=top>
<td>
1252 <BR>(See Note 1) </td>
<td>
Western </td>
<td>
iso-8859-1<BR>except when 128-159 is used, use &quot;Windows-1252&quot; </td>
<td>
iso-8859-1 </td>
</tr>
<tr valign=top>
<td>
28592 </td>
<td>
Central European (ISO) </td>
<td>
iso-8859-2 </td>
<td>
iso8859-2, iso-8859-2, iso_8859-2, latin2, iso_8859-2:1987, iso-ir-101, l2, csISOLatin2 </td>
</tr>
<tr valign=top>
<td>
1250 </td>
<td>
Central European (Windows) </td>
<td>
Windows-1250 </td>
<td>
Windows-1250, x-cp1250 </td>
</tr>
<tr valign=top>
<td>
1251 </td>
<td>
Cyrillic (Windows) </td>
<td>
Windows-1251 </td>
<td>
Windows-1251, x-cp1251 </td>
</tr>
<tr valign=top>
<td>
1253 </td>
<td>
Greek (Windows) </td>
<td>
Windows-1253 </td>
<td>
Windows-1253 </td>
</tr>
<tr valign=top>
<td>
1254 </td>
<td>
Turkish (Windows) </td>
<td>
Windows-1254 </td>
<td>
Windows-1254 </td>
</tr>
<tr valign=top>
<td>
932 </td>
<td>
Japanese (Shift-JIS) </td>
<td>
shift_jis </td>
<td>
shift_jis, x-sjis, ms_Kanji, csShiftJIS, x-ms-cp932 </td>
</tr>
<tr valign=top>
<td>
51932 </td>
<td>
Japanese (EUC) </td>
<td>
x-euc-jp </td>
<td>
Extended_UNIX_Code_Packed_Format_for_Japanese, csEUCPkdFmtJapanese, x-euc-jp, x-euc </td>
</tr>
<tr valign=top>
<td>
50220 </td>
<td>
Japanese (JIS) </td>
<td>
iso-2022-jp </td>
<td>
csISO2022JP, iso-2022-jp </td>
</tr>
<tr valign=top>
<td>
1257 </td>
<td>
Baltic (Windows) </td>
<td>
Windows-1257 </td>
<td>
windows-1257 </td>
</tr>
<tr valign=top>
<td>
950 </td>
<td>
Traditional Chinese (BIG5) </td>
<td>
big5 </td>
<td>
big5, csbig5, x-x-big5 </td>
</tr>
<tr valign=top>
<td>
936 </td>
<td>
Simplified Chinese (GB2312) </td>
<td>
gb2312 </td>
<td>
GB_2312-80, iso-ir-58, chinese, csISO58GB231280, csGB2312, gb2312 </td>
</tr>
<tr valign=top>
<td>
20866 </td>
<td>
Cyrillic (KOI8-R) </td>
<td>
koi8-r </td>
<td>
csKOI8R, koi8-r </td>
</tr>
<tr valign=top>
<td>
949 <BR>(See Note 2) </td>
<td>
Korean (KSC5601) </td>
<td>
ks_c_5601 </td>
<td>
euc-kr </td>
</tr>
<tr valign=top>
<td>
1255 (logical) <BR>(See Note 3) </td>
<td>
Hebrew (ISO-logical) </td>
<td>
Windows-1255 </td>
<td>
iso-8859-8i </td>
</tr>
<tr valign=top>
<td>
1255 (visual) </td>
<td>
Hebrew (ISO-Visual) </td>
<td>
iso-8859-8 </td>
<td>
ISO-8859-8 Visual, ISO-8859-8 , ISO_8859-8, visual </td>
</tr>
<tr valign=top>
<td>
862 </td>
<td>
Hebrew (DOS) </td>
<td>
dos-862 </td>
<td>
dos-862 </td>
</tr>
<tr valign=top>
<td>
1256 </td>
<td>
Arabic (Windows) </td>
<td>
Windows-1256 </td>
<td>
Windows-1256 </td>
</tr>
<tr valign=top>
<td>
720 </td>
<td>
Arabic (DOS) </td>
<td>
dos-720 </td>
<td>
dos-720 </td>
</tr>
<tr valign=top>
<td>
874 </td>
<td>
Thai </td>
<td>
Windows-874 </td>
<td>
Windows-874 </td>
</tr>
<tr valign=top>
<td>
1258 </td>
<td>
Vietnamese </td>
<td>
Windows-1258 </td>
<td>
Windows-1258 </td>
</tr>
<tr valign=top>
<td>
65001 </td>
<td>
Unicode UTF-8 </td>
<td>
UTF-8 </td>
<td>
UTF-8, unicode-1-1-utf-8, unicode-2-0-utf-8 </td>
</tr>
<tr valign=top>
<td>
65000 </td>
<td>
Unicode UTF-7 </td>
<td>
UNICODE-1-1-UTF-7 </td>
<td>
utf-7, UNICODE-1-1-UTF-7, csUnicode11UTF7, utf-7 </td>
</tr>
<tr valign=top>
<td>
50225 </td>
<td>
Korean (ISO) </td>
<td>
ISO-2022-KR </td>
<td>
ISO-2022-KR, csISO2022KR </td>
</tr>
<tr valign=top>
<td>
52936 <BR>(See Note 4) </td>
<td>
Simplified Chinese (HZ) </td>
<td>
HZ-GB-2312 </td>
<td>
HZ-GB-2312 </td>
</tr>
<tr valign=top>
<td>
28594 </td>
<td>
Baltic (ISO) </td>
<td>
iso-8869-4 </td>
<td>
ISO_8859-4:1988, iso-ir-110, ISO_8859-4, ISO-8859-4, latin4, l4, csISOLatin4 </td>
</tr>
<tr valign=top>
<td>
28585 </td>
<td>
Cyrillic (ISO) </td>
<td>
iso_8859-5 </td>
<td>
ISO_8859-5:1988, iso-ir-144, ISO_8859-5, ISO-8859-5, cyrillic, csISOLatinCyrillic, csISOLatin5 </td>
</tr>
<tr valign=top>
<td>
28597 </td>
<td>
Greek (ISO) </td>
<td>
iso-8859-7 </td>
<td>
ISO_8859-7:1987, iso-ir-126, ISO_8859-7, ISO-8859-7, ELOT_928, ECMA-118, greek, greek8, csISOLatinGreek </td>
</tr>
<tr valign=top>
<td>
28599 </td>
<td>
Turkish (ISO) </td>
<td>
iso-8859-9 </td>
<td>
ISO_8859-9:1989, iso-ir-148, ISO_8859-9, ISO-8859-9, latin5, l5, csISOLatin5 </td>
</tr>
</table><br>
<p>
<b>Notes: Source documents</b> </p>
<p>
Note 1: us-ascii, ascii, iso8859-1, iso_8859-1, iso-8859-1, ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO_646, irv:1991, ISO646-US, us, IBM367, cp367, csASCII, latin1, iso_8859-1:1987, iso-ir-100, ibm819, cp819, Windows-1252, x-ansi </p>
<p>
Note 2: ks_c_5601, ks_c_5601-1987, korean, csKSC56011987, euc-kr </p>
<p>
Note 3: ISO_8859-8:1988, iso-ir-138, hebrew, csISOLatinHebrew, Windows-1255, ISO_8859-8i , ISO_8859-8e, ISO-8859-8i, ISO-8859-8e , logical </p>
<p>
Note 4: http://www.internic.net/rfc/rfc1843.txt </p>
</BODY>
</HTML>
